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Method For Low-Energy Adaptive Clustering Hierarchy 

STATEMENTS REGARDING FEDERALLY SPONSORED RESEARCH 
5 Not applicable. 

CROSS-REFERENCE TO RELATED APPLICATIONS 
Not applicable. 

1 0 FIELD OF THE INVENTION 

The present invention relates to data communications networks. More specifically, 
the present invention relates to energy efficient wireless networks. 

BACKGROUND OF THE INVENTION 

15 As is known in the art, a distributed microsensor network is a network in which 

multiple, small, inexpensive, easy to handle sensors, interfaced with microprocessors, are 
deployed and distributed in a region for monitoring and control purposes. The 
microprocessors can transfer data collected by the sensors along with network control 
information among the microprocessors themselves or to a central base station via 

20 communication paths. 

In some instances the microprocessors and attached sensors, collectively referred 
to as microsensor nodes or more simply nodes, are physically separated from each other 
but are coupled via a wireless network to provide a wireless distributed microsensor 
25 network. Using wireless communication between the nodes eliminates the need for a 
fixed communications infrastructure. 

Each microsensor node includes a microprocessor, associated microsensor, power 
source and control, and a communications interface. The communications interface can 
30 be a radio frequency (RF) transmitter and receiver in wireless applications. In addition to 
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the nodes themselves being relatively inexpensive, deployment of wireless microsensor 
node networks is relatively inexpensive compared with conventional networks which 
utilize relatively expensive macrosensors which are directly wired to a central controller. 

5 These microsensor networks are fauU-tolerant, due to the sheer number of nodes 

which can ensure that there is enough redundancy in data acquisition even if not all nodes 
are functional. A limitation on the fault-tolerant property is that connectivity between all 
remaining nodes and a central base station must be maintained when some nodes fail or 
run out of energy, 

10 

Such wireless distributed microsensor networks are used to monitor a variety of 
environments for both civil and military applications. For example, for a security 
system, acoustic, seismic, and video sensors can be used to form an ad hoc wireless 
network to detect intrusions, Microsensors can also be used to monitor machines for 
1 5 fault detection and diagnosis . 

Communication protocols, in such wireless distributed networks can have 
significant impact on the overall energy dissipation of these networks. Ideally, network 
protocols provide fault tolerance in the presence of individual node failure while 
20 minimizing energy consumption. 

Eventually, the data being sensed by the nodes in the network must be 
transmitted to the central base station, where the end-user can access the data. One 
problem with wireless microsensor node networks, however, is that channel bandwidth 
25 is a limited network resource which must be shared among all the sensors in the 

network. Thus, it is desirable to provide routing protocols for these networks which 
reduce bandwidth requirements for data transmission 

There are many possible models for wireless microsensor node networks. For 
30 example, some microsensor networks include a fixed base station and distributed sensors 
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located relatively far from the base station. Generally increased distance from the base 
station requires increased RF energy to be expended by a node to communicate with the 
base station. In such networks, the nodes in the network typically are homogeneous and 
energy-constrained. One problem with homogeneous and energy constrained networks is 
5 that communication between the sensor nodes and the base station is relatively expensive 
in terms of energy consumption. 

To overcome this problem, some systems focus on energy-optimized solutions at 
all levels of the network hierarchy, from the physical layer and communication protocols 
10 up to the application layer and efficient DSP design for microsensor nodes. These 
approaches, however, are sometimes relatively expensive and complex to implement. 

There have been several network routing protocols proposed for wireless 
networks. In one approach referred to as a direct communication protocol approach, each 

1 5 sensor sends its data directly to the base station. One problem with this approach, 
however, is that if the base station is far away from the nodes, direct communication 
between the base station and the nodes requires a relatively large amount of transmit 
power from each node. The need for a relatively large amount of transmit power quickly 
drains the node battery and thereby reduces the system lifetime. Another problem with 

20 direct communication protocol approaches is that sensor networks contain too much data 
for transmission. Also, the sensor networks contain more data than can be efficiently 
processed by an end-user. Therefore, automated methods of combining or aggregating 
the data into a small set of meaningful information is required. 

25 A second approach is a so-called "minimum-energy" routing protocol In 

networks using minimum-energy protocols, nodes route data destined ultimately for the 
base station through intermediate nodes. Thus, nodes act as routers for other nodes' data 
in addition to sensing the environment and transmitting locally collected data. 
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One problem with this approach is that the router nodes can quickly run out of 
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power. There are some minimum energy protocols which only consider the energy of the 
transmitter and neglect the energy dissipation of the receivers in determining the routes. 
In such protocols, the intermediate nodes are chosen such that the transmit amplifier 
energy (thus node energy) is minimized. However, for this minimum-transmission- 
5 energy (MTE) routing protocol, rather than just one (relatively high-energy) transmission 
of the data, each data message must go through n (low-energy) transmissions and n 
receptions. Thus, depending on the relative costs of the transmit amplifier and the radio 
electronics, the total energy expended in the network might actually be greater using 
MTE routing than direct transmission to the base station. 

10 

In MTE routing, the nodes closest to the base station are used to route a large number 
of data messages to the base station. Thus these nodes will die out quickly, causing the 
energy required to get the remaining data to the base station to increase and more nodes to 
die. This will create a cascading effect that will shorten system lifetime. In addition, as 
1 5 nodes close to the base station die, that area of the environment is no longer being monitored. 
Conventional approaches to routing such as MTE contain these drawbacks when the nodes 
are all energy-constrained. 

When transmission energy is on the same order as reception energy, which occurs 
20 when transmission distance is short and/or the radio electronics energy is high, direct 

transmission is more energy-efficient on a global scale than MTE routing. Thus the most 
energy-efficient protocol to use in any particular application depends upon the network 
topology and radio parameters of the network. 

25 It would, therefore, be desirable to provide a network communication protocol 

that minimizes energy dissipation in sensor networks. It would also be desirable to 
evenly distribute the energy load among the sensor nodes in the network. It would further 
be desirable to reduce the amount of information that must be transmitted to the base 
station and increase the use of the communications bandwidth. It would be still further 

30 desirable to provide a wireless network having many microsensor nodes and a prolonged 
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system life when the nodes are energy-constrained. 

SUMMARY OF THE INVENTION 
5 In view of the above problems and limitations of existing distributed sensor 

networks and protocols and in accordance with the present invention, the importance of 
balancing the node energy load, and reducing the data transmitted when the nodes are 
energy-constrained has been recognized. It would therefore be desirable to provide a 
network having a clustering-based protocol which utilizes randomized rotation of local 
1 0 cluster-heads, localized coordination to enable scalability and robustness for dynamic 
networks, and the incorporation of data fusion into the routing protocol to reduce the 
amount of information that must be transmitted to a base station. 

In accordance with an aspect of the present invention, a network includes a base 
1 5 station and a plurality of nodes. Each of the nodes has a low energy mode, and a high 
energy mode and are organized into node clusters. Each node cluster includes a 
designated cluster-head. Each of the nodes in the cluster collects information and 
transmits the information to the cluster-head. The cluster-head then transmits the 
information collected by the nodes in the cluster to the base station. The network 
20 further comprises means for selecting a new cluster-head and means for forming new 
clusters about the new cluster-heads. With such an arrangement, the network achieves 
energy-efficiency by (i) randomized, adaptive, self-configuring cluster formation, (ii) 
localized control for data transfers, and (iii) low-energy medium access. By providing 
for the selection of new cluster-heads, the energy load is balanced in order to prolong 
25 the overall system life. Moreover, all the nodes in the network can share the limited 
channel bandwidth by using local low energy transmissions separated spatially from one 
another in order to avoid interference. The nodes can also enter a sleep mode to 
preserve energy. The nodes can additionally include attached microsensors or control 
interfaces. 
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In accordance with a further aspect of the present invention a method of forming a 
network from a pluraUty of nodes includes the steps of (a) forming the nodes into a 
plurality of clusters for a round of data transmission, each of the plurality of clusters 
containing a cluster-head; (b) operating the cluster to transfer data; (c) selecting new 
5 cluster-heads; and (d) repeating steps (a)-(c) for a subsequent round of data transmission. 
With such an implementation, an energy efficient protocol is provided. The cluster-heads 
schedule data transfers to minimize collisions and maximize sleep time of the other nodes 
in the cluster, thereby reducing energy dissipation. Furthermore, the cluster formation is 
self configuring and fault tolerant. 

10 

In accordance with a still further aspect of the present invention a method of 
cluster operation includes the steps of collecting data in a cluster-head and reducing the 
data transmission latency by using application-specific data aggregation to reduce the 
amount of redundant data transmitted from the cluster-head to a base station. Local data 

1 5 aggregation at the cluster-heads greatly reduces the amount of data that needs to be sent 
to the base station. An application-specific protocol architecture achieves the energy- and 
latency-efficiency needed for wireless microsensor networks. In addition to helping 
avoid information overload, data aggregation, also known as data fusion, can combine 
several unreliable correlated data measurements to produce a more accurate signal by 

20 enhancing common signals and reducing uncorrelated noise. Since the method is 
application specific and can achieve a greater level of redundant data reduction, the 
method is energy- and latency-efficient. Thus, application-specific data aggregation can 
also increase the signal to noise ratio of the data sent to the base station. The 
classification or higher level processing of aggregated data can be performed manually 

25 (e.g., with the aid of a human operator) or automatically. In one embodiment, the method 
of performing data aggregation and the classification algorithm are application-specific. 
For example, acoustic signals are often combined using a beamforming algoritlim to 
reduce several signals into a single signal that contains the relevant information of all the 
individual signals. Large energy gains can be achieved by performing the data fusion or 
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classification algorithm locally, thereby requiring much less data to be transmitted to the 
base station. 

The benefits of the present techniques and network topology include localized 
5 coordination and control for cluster setup and operation, randomized rotation of the 
cluster-heads and formation of the corresponding clusters, and local data aggregation to 
reduce global communication. The use of clusters for transmitting data to the base station 
leverages the advantages of short transmit distances for most nodes, requiring only a few 
nodes to transmit over far distances to the base station. The Low-Energy Adaptive 

1 0 Clustering Hierarchy (LEACH) technique of the present invention outperforms classical 
clustering algorithms by allowing the energy requirements of the network to be 
distributed among all the sensors. In addition, LEACH is able to perform local 
computation in each cluster to reduce the amount of data that must be transmitted to the 
base station. This achieves a large reduction in the energy dissipation, because 

1 5 computation is much less expensive than RF communication. 

In accordance with a still further aspect of the present invention, it is possible to 
form the clusters by collecting data on the status of each of the plurality of nodes and 
assigning each of the plurality of nodes to a particular cluster. With such an alternative 
20 cluster formation method, the base station can form clusters, which will generally be 
more efficient than those formed using a distributed algorithm as described above. 
However, the improved cluster formation is at the expense of requiring that each node 
transmit information to the base station at the beginning of each round about its location. 

25 Although the inventive teachings are disclosed with respect to wireless network 

applications, the present teachings may be used for other applications (e.g., non-wireless 
networks) as will be appreciated by those skilled in the art. Likewise, the type of sensor 
associated with the microprocessor is not limited to an acoustic or seismic sensor but 
could be an image sensor or any sensor as is know in the art which can be controlled by a 

30 microprocessor. Additionally the microprocessors in the inventive network could operate 
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as a node with or without a sensor and could include a control module for operating 
external devices. 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 The foregoing features of this invention, as well as the invention itself, may be more 

fully understood from the following description of the drawings in which: 

FIG. 1 A is a block diagram representing a network having nodes and a base 

station; 

FIG. IB is a block diagram representing a network having nodes organized into 
10 clusters with cluster-heads communicating with a base station according to the present 
invention; 

FIG. 2 is a block diagram representing a typical wireless node; 
FIG. 3 is a timing diagram of setup and steady state operation according to the 
present invention; 

1 5 FIG. 4 is a flow diagram of a method of forming a distributed cluster according to 

the present invention; 

FIG. 5 A is a diagram of the result of dynamic cluster formation prior to a round of 
operation of the low-energy adaptive clustering hierarchy according to the present 
invention; 

20 FIG. 5B is a diagram of the result of dynamic cluster formation in a round of 

operation subsequent to the round shown in FIG. 5A of the low-energy adaptive 

clustering hierarchy according to the present invention; 

FIG. 6 is a flow diagram illustrating steady state operation of the low energy 

adaptive clustering network; 
25 FIG. 7 is a timing diagram of data transmission according to the present invention; 

and 

FIGS. 8A, 8B, and 8C are diagrams of the correlation among data sensed by 
nodes according to the present invention. 
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DETAILED DESCRIPTION OF THE INVENTION 
Before describing the system, some concepts are identified and terminology is 
explained. As used herein, the term "network" or "system" refers to a plurality of nodes 
which communicate with each other and with one node which may be designated as a 
5 "base station". 

In an effort to promote clarity in the text, reference is sometimes made herein to a 
particular type of network (e.g., a wireless network) or a particular network configuration. 
Such reference should not be taken as limiting the invention to a particular type of 
1 0 network or to a specific network configuration. The present invention finds application in 
any network in which the nodes are energy constrained. 

Reference is also made herein to certain ones of the network nodes being part of a 
"cluster". Each cluster is made of one node corresponding to a cluster-head and may 

1 5 contain additional nodes corresponding to non-cluster-head nodes. Each of the cluster- 
heads and non-cluster heads perform certain functions to be described below. Those of 
ordinary skill in the art should appreciate, however, that during the operation of a network 
organized using the inventive hierarchy, a node can at any given time be a cluster-head in 
one cluster and can be a non-cluster-head node in a different cluster at a subsequent point 

20 in time. It should thus be appreciated that references made hereinbelow to a particular 
node corresponding to a cluster-head are made only for ease of explanation and, in 
accordance with the invention, any node can be either a cluster-head or a non-cluster 
head. Furthermore, any non-cluster head can elect itself to be a cluster head. 

25 It should also be understood that in some of the examples given below the nodes 

are identical or homogenous but that this need not be so. The adaptive cluster formation 
technique of the present invention, including the random selection of cluster-heads, can 
be used regardless of whether the nodes are homogenous. Furthermore, the techniques of 
the present invention are not affected by an unequal node energy distribution which can 

30 occur in some applications. The nodes described below are not required to be either fixed 
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or mobile. Likewise, as mentioned above, the present invention finds application in a 
wide variety of different network configurations and applications including but not 
limited to environmental monitoring, communications networks, image capture and 
seismic monitoring networks. 

5 

Referring now to FIG. lA, a network 100 includes a base station 102, and a 
plurality of nodes 1 10a- 1 10m generally denoted 110. All of the nodes 1 10 may have the 
capability to communicate with the base station 102. 

1 0 Referring now to FIG. 1 B, in which like elements of FIG. 1 A are provided having 

like reference designations. Once the nodes 1 10 are deployed in their environment, the 
nodes are organized into a plurality of individual clusters 1 12a-l 12k, generally denoted 
112. The number of clusters 1 12 generally expected for any particular application is 
preferably a configuration parameter. 

15 

The base station 102 can communicate with the plurality of clusters 112. Taking 
cluster 1 12a as representative of clusters 112, cluster 1 12a is made up of nodes 1 10a- 

I lOf Node 1 lOd corresponds to a cluster-head and is denoted llOd,,,. The base station 
102, communicates with cluster 1 12a via transmissions to and from cluster-head 1 lOd,,,. 

20 The cluster 112a thus includes a plurality of non-cluster-head nodes 1 1 Oa- 1 1 Oc, 1 1 Oe and 

I I Of and cluster-head 1 lOd,,,. 

The network 100 of the present invention implements a Low-Energy Adaptive 
Clustering Hierarchy (LEACH) protocol. LEACH is an application-specific protocol 

25 architecture for wireless microsensor network 10 which provides randomized, adaptive, 
self-configuring cluster formation; localized control for data transfers; low-energy 
medium access; and application-specific data aggregation to reduce the amount of data 
sent to the user. A clustering protocol has several advantages in terms of energy- and 
bandwidth-efficiency. During cluster formation, the nodes 110 organize themselves into 

3 0 local clusters 1 1 2a- 1 1 2c generally denoted 1 1 2 and a subset of the nodes 1 1 0 act as so- 
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called cluster-heads. In FIG, IB, nodes 1 lOd, 1 lOh and 1 10m correspond to the cluster- 
heads, generally denoted 1 lO^j^^. Using a clustering approach, all non-cluster-head nodes, 
generally denoted 1 1 0;^^,,.,|,, in clusters 1 12a- 1 12k only need to transmit their data a short 
distance, i.e., to the respective cluster-heads 1 lOd.i, 1 lOh^h and 1 lOm.i, Furthermore, only 
5 the cluster-heads 1 lOd.i, 1 lOh.^ and 1 10m,i, need to receive data, which is an energy- 
intensive operation. Clustering can therefore reduce energy consumption for most of the 
nodes 1 10 in the network. 

In addition, the cluster-head 1 lOd^^, can coordinate the transmissions of the nodes 
10 1 1 Oa-1 1 Oc, 1 1 Oe and 1 1 Of in the cluster 1 1 2a. This reduces the number of times two 
transmissions interfere with each other (a collision) and also reduces the amount of time 
nodes 1 lOa-1 lOc, 1 lOe and 1 lOf need to be awake to transmit data. 

Furthermore, clusters 1 12a-l 12k can automatically facilitate spatial reuse of the 
15 bandwidth by reducing transmission energy so that nodes 110 having sufficient spatial 
separation can use the same transmission frequencies because these separate low-energy 
transmissions will not interfere with each other. Finally, if the data collected by nodes 
1 1 0 within each cluster 1 1 2a-l 1 2k are correlated, the cluster-heads 1 1 Od.^, ^ 1 1 Oh,i, and 
1 10m,i^can perform local data aggregation within each cluster 1 12a-l 12k as will be 
20 explained below. 

If the cluster-heads 110,,,, were chosen a priori and fixed throughout the system 
lifetime, as in conventional clustering algorithms, the cluster-heads llO.i,, would die 
quickly, ending the useful lifetime of all nodes 110 belonging to clusters 1 12a- 1 12c 
25 because there would be no way to communicate with the base station 1 02. 

In accordance with the present invention, however, any node 1 10 in the network 
100 can serve as a cluster-head at some point in time. In one embodiment, a randomized 
rotation of the high-energy cluster-heads 1 lO.^s is used such that functions provided by 
30 the cluster-head rotate amongst the nodes 1 10 in order to avoid draining the power source 
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of any one node 1 10 in the network 100. 

One particular technique for selecting cluster-heads and forming clusters is 
described below in conjunction with FIGS. 3 - 8C. In general, once the nodes 110 are 

5 deployed in the environment of interest, certain ones of the nodes elect themselves as 
cluster-heads. In the example of FIG. 1 A, nodes 1 lOd, 1 lOh and 1 10m have elected 
themselves as cluster-heads. Next, the remaining nodes, (i.e., the non-cluster-head nodes 
1 1 0No„.chs) 1 1 Oa- 1 1 Oc, 1 1 Oe- 1 1 Og, 1 1 Oi- 1 1 Oj determine which cluster-head they will 
communicate with. The clusters 1 12a, 1 12b and 1 12c are formed once each non-cluster- 

1 0 head nodes 1 1 O^^^..^^ selects a cluster-head 1 1 0,,, with which to communicate. 

Cluster 1 12a', including nodes 1 lOb, 1 lOd, 1 lOe, 1 lOi and 1 10k with node 1 lOe',, 
operating as the cluster head, represents a cluster formed subsequent to initial cluster 
formation. In cluster 1 12a', node 1 lOd is no longer operating as the cluster head 1 10 
1 5 Likewise, cluster 1 1 2b', including nodes 1 1 Oa, 1 1 Oc, 1 1 Of, 1 1 Og, and 1 1 Oh and 1 1 Oj with 
node llOc',1, operating as the cluster head is a subsequently formed cluster 112. 

One possible node 1 10 architecture is shown in FIG. 2, It should be appreciated 
that nodes 1 10 can use commercially available components. In this particular example, a 

20 node 110 includes a microprocessor 128 such as a StrongARM (SA-1 100) 

microprocessor running a lean version of the RedHat eCos operating system for 
implementation of digital signal processing (DSP) algorithms, powered from a DC/DC 
Power Converter 124 connected to a power source 122. The power source 122 can be 
provided, for example, as a battery. It should be appreciated that other power sources 

25 such as solar power or self-powered sources (e.g. power derived from machine 
vibrations) can power the nodes 110. 
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The microprocessor 128 controls a sensor 126, an analog to digital converter 
(A/D) 127 and an RF transceiver module 129. The RF transceiver module 129 includes a 
signal strength processor 123. The node can contain a control unit 125 for controlling 
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external devices. The use of the microprocessor 128 allows rapid, easy programming of 
the nodes 1 10 to execute protocols and enables monitoring of the energy dissipation 
required for the various functions performed within the protocol. The microprocessor 
128 can also place the nodes 110 into a high energy, a low energy or a sleep state. The 
5 RF transceiver module 129 dissipates approximately 50 nJ/bit to 1 000 nJ/bit to run the 
transmitter or receiver circuitry. Low energy radios are described in an article entitled 
"Energy-Efficient Communication Protocol for Wireless Microsensor Networks," W. 
Rabiner Heinzelman, A. Chandrakasan, and H. Balakrishnan, Proceedings of the 33^^ 
International Conference on System Sciences (HICSS '00), January 2000. 

10 

The signal strength processor 123 is used in the cluster 1 12 formation process to 
determine the cluster-head 110,,, that requires the minimum communication energy for 
the non-cluster-head node 1 \%,^^^,. It should be appreciated that other indirect means for 
determining communication energy, for example, a global positioning system (GPS) 
1 5 which can determine inter-nodal distances, can be used in the cluster 1 1 2 formation 
process. One of ordinary skill in the art would recognize that various types of sensors 
126 and control interfaces including digital input and output modules, and digital to 
analog converters (DACs) could be interfaced to microprocessor and that a node could 
have a range of power requirements. 

20 

The node 110 also includes a cluster-head selector processorl28b and a cluster 
selector processor 128a. The detailed operations of cluster-head selector processor 128b 
and cluster selector processor 128a will be described below in conjunction with FIG. 4. 
Suffice it here to say that cluster-head selector processor 128b determines when the node 

25 will elect or designate itself as a cluster-head and that cluster selector processor 128a 

determines the cluster-head with which the node will communicate once it is determined 
that the node is a non-cluster-head node 1 10Non.eh- The cluster and cluster-head selection 
processors may be implemented as individual circuits or processors or may be provided 
as part of microprocessor 128 (i.e., microprocessor 128 would perform the functions of 

30 the cluster and cluster-head selection circuits). The blocks denoted "processors" can 
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represent computer software instructions or groups of instructions. Alternatively, the 
processing blocks represent steps performed by functionally equivalent circuits such as a 
digital signal processor circuit or an application specific integrated circuit (ASIC). 

5 It should be appreciated that since the functions performed by the cluster-head are 

swapped or rotated amongst each of the nodes 100, then the cluster and cluster-head 
selector processor 128b in each node must periodically operate to determine whether they 
should become a cluster-head and if not, to determine which cluster they should join, 

1 0 In operation, the system communication over time is divided by the protocol into 

rounds 130 as shown in FIG. 3. Each round 130 begins with a set-up phase 132 when the 
clusters 100 are formed, followed by a steady-state phase 133 in which several frames 
134 of data are transferred from the non-cluster-head nodes 1 lO^on-chs the respective 
cluster-heads 1 10a,,, and on to the base station 102. In order to minimize overhead, the 

15 steady-state phase 133 is long compared to the set-up phase 132. Using cluster 1 12a as an 
example, in each frame 134, each node 1 lOa-1 10c, 1 lOe and 1 1 Of is assigned a time slot 
to transmit data collected from each associated sensor 126 (as shown in FIG, 2) to the 
cluster-head 1 lOd.i,. The nodes 110 must all be time- synchronized in order to start the 
set-up phase 132 at the same time. The nodes 1 10 are preferably synchronized by the 

20 base station 1 02 transmitting synchronizing signals to the cluster-heads 1 1 0,i,, which in 
turn synchronize the non-cluster-head nodes 1 10^,,,.,i,,. It should be appreciated that other 
methods can be used to synchronize the nodes 110 such as a signal from a global time 
source. Thus, each cluster 1 12a - 1 12c begins each round 130 at the same time. 

25 Routing and medium access in the system are preferably selected to minimize 

energy and latency by exploiting application-specific information. High-energy data 
transfers in the system 100 are scheduled during each round 130 by the then designated 
cluster-heads 1 1 Od,i,, 1 1 Oh,i, and 1 1 Om,,, to ensure no collisions in the data transmissions 
and to allow non-cluster-head nodes 1 lO^on-chs to remain in the sleep state as long as 

30 possible. The medium access during all phases of the system is preferably selected to 
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minimize collisions and maximize sleep time. Including a sleep state is one method to 
lower the medium access energy requirements. Additionally adjustable power levels and 
other techniques known in the art can be used to conserve energy during operation. 

5 The system 100 also facilitates local processing of the data at each of the cluster- 

heads 1 lOchs- Data aggregation techniques can be used to reduce the amount of data that 
needs to be transmitted from each of the cluster-heads 1 10^,^ to the base station 102, 
greatly minimizing the energy dissipated in the cluster-heads 1 10,,,^. 

10 In a preferred embodiment, the system forms clusters 1 1 2a- 1 1 2c by using a 

distributed algoritlrm, where nodes 110 make autonomous decisions without any 
centralized control. The advantages of this approach are that no long-distance 
communication with the base station 102 is required and distributed cluster formation can 
be accomplished without knowing the exact location of any of the nodes 1 10 in the 

15 network 100. In addition, no global communication among the nodes 1 10 is necessary to 
set up the clusters 1 12. Rather, clusters 1 12 are formed out of the nodes 110, purely via 
local decisions made autonomously by each node 110. Starting after an initial round 130 
of operation denoted round r, some of the nodes 110 elect themselves to be cluster-heads 
110,,, at the beginning of round 130 ( r+1) (which starts at time t, denoted by reference 

20 number 135) with a certain probability, P, (t). This probability is chosen such that the 
expected number, E, of cluster-heads 1 1 0,^, for this round 1 30 is k, a network parameter. 
Thus: 

E[# CH] = P,(t)* 1 = k (Equation 1) 

where N is the total nuraber of nodes 1 10 in the network. The parameter K is the average 
25 number of clusters and is preferably a parameter set by the system administrator when the 
network 100 is configured. In another embodiment the parameter K could be changed 
dynamically by the base station 102 or calculated automatically. The calculation of the 
optimum number of clusters is described in ''Application-Specific Protocol Architectures 
for Wireless Networks " W. Heinzelman, PhD Thesis, Massachusetts Institute of 
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Technology, June 2000. 

In order to maximize system lifetime, all nodes 110 preferably dissipate an equal 
amount of energy. This requires an approach, in which, no individual node 1 10 is being 
5 overly utilized compared to the other nodes 110. Because being a cluster-head is 
significantly more energy-intensive than being a non-cluster-head node 1 lOi^T^ivch? this 
requirement translates to having the nodes 1 10 be cluster-heads 1 lO^i,, the same number 
of times, assuming all nodes 110 begin with approximately the same amount of energy. 
If k nodes 1 10 are cluster-heads 1 lO,,,^ during each round 130, each node 110 should be a 
10 cluster-head 1 10,i, once in N/k rounds. Combining these constraints, provides the 
following probability for each node 110 (node i ) to be a cluster-head 1 10,i, at time t: 



where r is the number of rounds 130 that have been completed and C,(t) = 0 if node i 110 
1 5 has already been a cluster-head 1 1 0 in the most recent (r mod N/k) rounds and 1 

otherwise. Therefore, only nodes 110 that have not already been cluster-heads 110 ^i^, and 
hence have more energy available than nodes 110 that have performed this energy- 
intensive function, may become cluster-heads 1 10,i,, at round 130 r+1 . There will be 
approximately (N-k)*r nodes 1 10 that have not been cluster-heads 1 10,,,, in the first r 
20 rounds 130. After all nodes 110 have been cluster-heads 1 lO.^,, (which occurs in N/k 
rounds), all nodes 1 10 are again eligible to become cluster-heads 1 1 0,1,3. Therefore, the 
term N-k'^(r mod N/k) represents the total number of nodes 110 that are eligible to be a 
cluster-head 110,,, during round 130 (r+1), and 



25 This ensures that the energy at each node 1 10 is approximately equal after every N/k 

rounds 130. Using Equations (2) and (3), the expected number of cluster-heads 110,,,, per 
round 130 is: 




C.(t) = l 
C,(t) = 0 



(Equation 2) 



^ Ci(t) = N-k*(r mod N/lc) 



(Equation 3) 



(=1 
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E[#CH]-X PXt)n 

- (N-k^(r mod N/k)) ^ Ic/ (N-k^(r mod N/k)) 

= k (Equation 4) 

5 

This choice of probabiUty for becoming a cluster-head llO^i^ assumes that all 
nodes 110 begin with an equal amount of energy. If each node 110 begins with different 
amounts of energy, the nodes 110 with more energy will be cluster-heads 1 lO^^^^ more 
often than the nodes 110 with less energy, in order to ensure that all nodes 1 10 die at 
10 approximately the same time. In this case, the probability of becoming a cluster-head 
1 10,h depends on a node^s 1 10 energy level relative to the aggregate energy remaining in 
the network 10, rather than purely the number of times the node 1 10 has been a cluster- 
head 110^,,: 

1 5 P,(t) = (E.(t)/E,,,,(t))nc (Equation 5) 

where E- (t) is the current energy of node i, and 

E.otai(t)-S E^t) (Equation 6) 

/-I 

Using these probabilities, the nodes 110 with higher energy are more likely to become 
cluster-heads 1 10,,, than nodes 110 with less energy. The expected number of cluster- 
20 headsl 10,1,5 is: 

E[#CH]= x P^tri 

/-I 

-(E,(t)/E,,, + ..,+E^(t)/E,,Jk 

= k (Equation 7) 

25 Equation 5 can be approximated by Equation 2 when the nodes 1 10 begin with 

equal energy, E,. If a node 1 10 has been a cluster-head 1 lOA.^ in the last r < N/k rounds 
130, its energy is approximately E^ - E^h, where Ech is a large number less than E^. If the 
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node has not been a cluster-head 1 10 A,,, in the last r rounds 130, its energy is 
approximately E„, since being a non-cluster-head node 1 lONon-ci, does not require much 
energy from the node 110. Since k*r nodes 110 have been cluster-heads 1 10,,,, and (N- 
k)*r nodes 1 10 have not been cluster-heads 1 10,^^ in the last r rounds 130, the total energy 
is given by: 

E,„„ = E„(N-kr) + (E„-EcH)(kr) (Equation 8) 

Therefore, Equation 5 becomes: 

Eok 



Eo(N-kr) + (Eo-EcH)kr 

C,(t) = l 

P,(t)= I (Eo-EcH)k C(t)-0 ^^'^'^^^^'''^^^ 



Eo(N - kr) + (Eo - EcH)lcr 
1 0 Since E„ » (E„-Ech), this can be simplified to: 



{ 



k 



V(t) ^ 1 ^'"^^ ^'^^^ ^ (Equation 10) 



This is exactly the same equation as Equation 2 (for r < N/k), Using the probabilities in 
Equation 5 requires that each node 110 have an estimate of the total energy of all the 
1 5 nodes 1 10 in the network, and hence requires a routing protocol that allows this to be 
computed, whereas the probabilities in Equation 2 enable each node 1 10 to make 
completely autonomous decisions. An alternate approach to avoid the routing protocol 
might be to approximate the aggregate network energy by averaging the energy of the 
nodes 1 10 in each cluster 100 and multiplying by N, 

20 

With reference to FIG. 4, at step 500, nodes 110 determine if they have elected 
themselves to be local cluster-heads 1 10,i,, at any given time with a certain probability as 
given in Equations 2 and 5. At step 510, these cluster-heads 1 10,i,, broadcast their status 
to the other nodes 1 10 in the network. At step 550, each node 110 chooses the cluster- 
25 head 1 10^,, with which it wants to communicate. 
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In a preferred embodiment, each non-cluster-head node 1 lO^^,,.,!, measures the 
signal strength of the cluster-head status announcements transmitted by the cluster-heads 
1 10,1,3. Each non-cluster-head node 1 lO^,,,.,!, compares the signal strengths of received 
5 status messages and selects the cluster-head 1 10,i, that requires the minimum 

communication energy for the non-cluster-head node 1 lO^on-dr By selecting a cluster- 
head 110,;,, the non-cluster-head node 1 10^,„.,h also selects the cluster 1 12 to which it will 
belong for that round 130. It should be appreciated that although the cluster-head 1 10,i, 
with the strongest signal (relative to the selecting node 110) would typically be chosen by 
1 0 each non-cluster-head node 1 1 O^^^.,,, other criteria such as environmental factors and 
geography can be used in selecting the cluster 1 12 to be joined for each round 130. 

Once all the nodes 1 10 are organized into clusters 1 12, each of the cluster-heads 
1 10,1,3 creates a schedule, step 530, for the nodes 1 10 in its cluster. This allows the radio 

1 5 components of each non-cluster-head node 1 1 O^on^ch to be turned off at all times except 
during its transmit time, thus minimizing the energy dissipated in the individual nodes 
110. Once the cluster-head 1 10,i, has all the data from the nodes 1 10 in its cluster 1 12, 
the cluster-head 1 10,i, aggregates the data and then transmits the aggregated data to the 
base station 102. Since the base station 102 can be located far away this could be a high 

20 energy transmission. 

Since there are relatively few cluster-heads 1 10,^,, compared with the total number 
of nodes 110, this only affects a small number (K) of nodes 1 10. As discussed above, 
being a cluster-head 1 lO,,, drains that node's 110 battery more heavily. In order to spread 
25 this energy usage over multiple nodes 110, the cluster-heads 1 10,1,^ are not fixed; rather, 
this position is self-elected at different time intervals. Thus a set C of nodes 110 might 
elect themselves cluster-heads at time tl , but at time tl + d a new set C of nodes elect 
themselves as cluster-heads 

30 In FIG, IB, for example, nodes 1 lOd.^^^ 1 10h,„ and 1 lOm,^, are shown as cluster- 
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heads for a particular transmission round. In a subsequent transmission round, however, 
nodes 1 lOe, 1 lOg and 1 lOi can elect themselves as cluster-heads and nodes llOd, 1 lOh 
and 110m are non-cluster-head nodes 110Non.chs- 

5 Furthermore, the clusters themselves may include different nodes and a different 

number of clusters can be used for a given round of communications. For example, 
referring briefly to FIG. IB, if node 1 lOe is a cluster-head, cluster 112a' may be 
comprised of nodes 11 Ob, 1 lOd, 1 101 and 1 lOr. Similarly, if node llOg is a cluster-head, 
then cluster 1 12b' may be comprised of nodes 1 lOa, 1 10c, 11 Of, 1 1 Oh and 1 lOj. 

10 

While the distributed algorithm for determining cluster-heads 1 lO.^^ ensures that 
the expected number of clusters 100 per round is k (a configuration parameter), it does 
not guarantee that there are k clusters at each round. In addition, the set-up protocol does 
not guarantee that nodes are evenly distributed among the cluster-heads. Therefore, the 

1 5 number of nodes per cluster is highly variable in LEACH, and the amount of data each 
node can send to the cluster-head 1 10,,, varies depending on the number of nodes in the 
cluster. It is possible that a smaller cluster 1 12 can have more data transmission cycles in 
a round 1 30 than a larger cluster 112. The decision to become a cluster-head 1 1 0,,, 
depends at least in part on the amount of energy left at the node 110. In this way, nodes 

20 110 with more energy remaining will perform the energy-intensive functions of the 
network 10. 

With reference again to FIG. 4, once the nodes 110 have elected themselves at 
step 500 to be cluster-heads 1 lO,,, using the probabilities in Equation 2 or 5, the cluster- 

25 heads 1 1 0,,,^ must let all the other nodes 1 1 0 in the network 1 0 know that they have 

chosen this role for the current round 130. To do this, each cluster-head 1 10,,, broadcasts 
an advertisement message (ADV) at step 510 preferably using a carrier-sense multiple 
access (CSMA) Media Access Control (MAC) protocol. It should be appreciated that 
other protocols may be used. This message is a small message containing an identified 

30 (ID) of each node 1 1 0 and a header that distinguishes this message as an announcement 
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message. This message must be broadcast to reach all of the nodes 110 in the network 
10. The reason for this is two-fold. First, ensuring that all nodes 110 receive the 
advertisement essentially eliminates collisions when carrier-sense is used, since there is 
no hidden terminal problem. Second, since there is no guarantee that the nodes 1 10 that 

5 elect themselves to be cluster-heads 1 1 0,,, are spread evenly throughout the network 1 0, 
using enough power to reach all nodes 1 10 ensures that every node 1 10 can become part 
of a cluster 100. If the power of the advertisement messages was reduced, some nodes 
1 10 on the edge of the network may not receive any announcements and therefore may 
not be able to participate in this round 130 of the protocol. Since these advertisement 

1 0 messages are small, the increased power to reach all nodes 1 1 0 in the network 1 0 is not a 
burden. 

If at step 510, no cluster-heads 110,,, are elected, the base station 102 can 
preferably communicate with the nodes 110 for the current round 130. Alternatively, the 
15 base station 1 02 can signal the nodes 1 1 0 to reform the clusters 1 1 2 by returning to step 
500. 

At step 540, each non-cluster-head node 1 lO^on-ch determines to which cluster it 
wants to belong by choosing the cluster-head 1 10,,, that requires the minimum 

20 communication energy, based on the received signal strength of the advertisement from 
each cluster-head 1 10,,,. Assuming symmetric propagation chaimels for pure signal 
strength, the cluster-head 110 advertisement heard with the largest signal strength is the 
cluster-head 1 10,^ to whom the minimum amount of transmitted energy is needed for 
communication. This will typically be the cluster-head 110,,, closest to the sensor node 

25 110. However, if there is some obstacle impeding the communication between two 
physically close nodes 110 (e.g., a building, a tree, etc.) such that communication with 
another cluster-head 1 10,,,, located further away, is easier, the sensor will choose the 
cluster-head that is spatially further away but "closer" in a communication sense. In the 
case of ties (e.g., two cluster-heads require same amount of energy to reach a non-cluster- 

30 head node 1 1 O^^n-aX a random function is used to select the cluster-head 1 1 0,,, with which 
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the node will communicate. 

After each node 1 10 has decided to which cluster 112 it belongs, it must inform 
the cluster-head 1 lO^i, of that cluster that it will be a member of that cluster. At step 550, 

5 each node 1 10 transmits a join-request message (Join-REQ) back to the chosen cluster- 
head 1 10^1, using CSMA. This message is again a short message, consisting of the ID of 
the node 1 10, the ID of the cluster-head 110,},, and a header. Since each nodes 1 10 has an 
indication of the relative power needed to reach the cluster-head 110,,, (based on the 
received power of the advertisement message), it could adjust its transmit power to this 

1 0 level. However, this approach suffers from the hidden-terminal problem; if a node 1 1 0 
close to the cluster-head 1 10,i, is currently transmitting a join-request message using low- 
power, the remaining nodes 1 10 in the cluster 1 12 cannot sense that this transmission is 
occurring and may decide to transmit their own join-request messages. Since these 
messages are small, it is more energy-efficient to increase the transmit power of the join- 

1 5 request messages than to use an IEEE 802. 1 1 protocol approach of transmitting request to 
send - clear to send (RTS-CTS) messages. Since the cluster-head 1 10,h does not know 
the size of its cluster 1 12, it would need to transmit the CTS message using large power 
to reach all potential cluster member, nodes 110. In addition, simply increasing the 
transmit power reduces the latency and increases the sleep time allowed for all the nodes 

20 110 compared with an RTS-CTS approach. Therefore, the nodes use a large amount of 
power for transmissions of the short join-request messages to the cluster-heads HO,,,,. 

The cluster-heads 1 lO^i^^act as local control centers to coordinate the data 
transmissions in their respective clusters 1 12. In step 530, the cluster-head sets up a Time 

25 Division Multiple Access (TDMA) schedule and transmits this schedule to the nodes 110 
in the cluster 1 12. This ensures that there are no collisions among data messages and also 
allows the radio components of each non-cluster-head node 1 10^o„.,i, to be turned off at all 
times except during their transmit time, thus minimizing the energy dissipated by the 
individual sensors. After the TDMA schedule is known by all nodes 1 10 in the cluster 

30 1 12, the set-up phase is complete and the steady-state operation (data transmission) can 
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begin. It should be appreciated that other protocols such as Frequency Division 
Multiplexing may be used to collect data. 

FIGS. 5 A and 5B show dynamic cluster formation during two different rounds 
5 130 of LEACH. All nodes 110 marked with a given symbol (e.g., circle, x, square, 
triangle, cross) belong to the same cluster 1 12 for a given round of operation, and the 
cluster-heads are designated as 1 10,i,,. A cluster 1 12 can contain a single node 1 10 acting 
as the cluster-head 1 10,i,. The clusters 1 12 can vary in shape and size from one round 130 
(r) to the next round 130 (r+l). The actual number of clusters 1 12 can vary from the 
1 0 expected number of clusters 1 1 2 in any given round. 

FIG. 6 illustrates the operational steps during the steady-state phase. After the 
cluster set-up step 600, the non-cluster-head nodes 1 lO^^,.,!,, are determined in step 610 
and these nodes 110 send their data to the cluster-head 1 10,h at most once per frame 134 

1 5 (shown in FIG. 3) during their allocated transmission time. This transmission uses an 
amount of energy determined from the received strength of the cluster-head 1 10,j, 
advertisement. In step 650, the radio of each non-cluster-head node 1 lO^^,,.,!, is turned off 
until its allocated transmission time, thus reducing the energy dissipated in these nodes. 
Decision block 660 implements a loop in which steps 600-650 are repeated until enough 

20 time has elapsed for the round to end. The duration of a round 130 can be set as a 

parameter by the system administrator or can be controlled by the base station 102 as a 
function of the energy distribution in the nodes 110. Since all the nodes 110 usually have 
data to send to the cluster-head 1 10,i, and the total bandwidth is fixed, using a TDMA 
schedule is efficient use of bandwidth and represents a low-latency approach, in addition 

25 to being energy-efficient. 

Once the decision is made in decision block 660 that the amount of time for this 
round has elapsed, then nodes 110 send their data during the scheduled time as shown in 
step 670 and then return to the sleep mode, step 680, until the next frame 134. 



30 
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If in decision block 610, a decision is made that the node is the cluster-head, then 
processing flows to step 620. In decision block 620, if a decision is made that not enough 
time has elapsed, then processing returns to step 600 and steps 600-620 are repeated. If 
in decision block 620, a decision is made that sufficient time has elapsed for the round to 
5 end, then processing proceeds to step 630. The cluster-head 1 10,|, must keep its receiver 
on during the round to receive all the data from the nodes 110 in the cluster 112. As 
shown in step 630, after the cluster-head 1 lO.^, receives all the data for a frame 134, it can 
operate on data step 640 (e.g., performing data aggregation, as discussed below) and then 
send the resultant data (or the original data) from the cluster-heads 1 10,j,, to the base 
10 station 102. Since the base station 102 may be far away and the data message is large, 
this is a high-energy transmission. 

FIG. 7 shows a time-line operation of network 100 operation. Data transmissions 
are explicitly scheduled to avoid collisions and increase the amount of time each non- 
1 5 cluster-head node 1 IOt^t^,.,!, can remain in the sleep state. 

The MAC and routing protocols ensure low-energy dissipation in the nodes 1 10 
and no collisions of data messages within a cluster 100. However, radio is inherently a 
broadcast medium. As such, transmission in one cluster (e.g., cluster 1 12a in FIG. 5A) 

20 will affect (and often degrade) communication in a nearby cluster (e.g., cluster 1 12b in 
FIG. 5 A). To reduce inter-cluster interference, each cluster 100 in LEACH 
communicates using a unique orthogonal spreading code, as in a CDMA network. Thus, 
when a node 110 decides to become a cluster-head 1 10,,,, it chooses randomly from a list 
of spreading codes. It informs all the nodes 1 10 in the cluster to transmit using this 

25 spreading code and filters all received energy using this spreading code. As in a cellular 
network , the interference comes from the nodes 1 10 in surrounding clusters 112. To 
reduce the possibility of interfering with nearby clusters 100 and reduce its own energy 
dissipation, each node 1 10 adjusts its transmit power to minimize interference. 
Therefore, little spreading of the data is actually needed to ensure a low probability of 

30 collision. 
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In a large sensor network 100, it would be difficult for an end-user to examine all 
the data from each sensor node 1 10 in the network. Therefore, each cluster-head can 
preferably utilize automated methods of combining or aggregating the data into a small 

5 set of meaningful information using techniques known in the art. In addition to helping 
avoid information overload, data aggregation, or data fusion, can combine several 
unreliable data measurements to produce a more accurate signal by enhancing the 
common signal and reducing the uncorrected noise. One method of aggregating data is 
beamforming. Beamforming combines signals from multiple sensor nodes 1 10 in order 

10 to satisfy an optimization criteria, such as minimizing mean squared error (MSE) or 
maximizing signal-to-noise ratio (SNR). Various algorithms such as the least mean 
squared (LMS) error approach and the maximum power beamforming algorithm have 
been developed to beamform signals. These algorithms have various energy and quality 
tradeoffs. For example, the maximum power beamforming algorithm is capable of 

1 5 performing blind beamforming, requiring no information about the sensor node 1 1 0 
locations. However, this algorithm is compute-intensive, which will quickly drain the 
limited energy of the node. 

Data aggregation can be performed at the cluster-heads 110 

chs network 1 00. If 

20 the energy for communication is greater than the energy for computation, performing the 
data aggregation algorithm locally at the cluster-head 11 0,^ can greatly reduce the overall 
system energy, since much less data needs to be transmitted to the base station 102. This 
will allow large computation versus communication energy gains with little to no loss in 
overall network quality. 

25 

In order for the cluster-heads 1 10,^, to perform data aggregation to compress the 
data into a single signal, data from the different nodes 1 10 in the cluster must be 
correlated. Because it is difficuk to determine exact correlation, a data-independent 
model determines the amount of correlation that exists between the data from different 
30 sensor nodes 110. The model is based on the assumption that the source signal travels a 
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distance before it can no longer be reliably detected by the sensors 126, and that the 
sensors 126 are omnidirectional (e.g., acoustic, seismic sensors). 

Now referring to FIG. 8A, this implies that the maximum distance between 
5 sensors 126 with correlated data is 2p covering area 190. However, being within 2p of 
each other does not guarantee that the two sensors 126 will detect the same signal as 
shown in FIG. 8B. 

FIG. 8C shows the minimum amount of overlap when all nodes 1 10 in a cluster 
1 0 are separated by at most distance d and their view of the world has a radius p. From FIG. 
8C, it is seen that the total amount of overlap, 0, in the nodes' 110 view of source signals 
is at least: 



0>7r 

1 5 Therefore, the fraction of overlap /is 



(Equation 11) 



(Equation 12) 



If d is written as a fraction x of p, d = xp, then the amount of overlap simplifies to 



/Ml 



(Equation 13) 



20 If the system requires that there be over 50% overlap, this means that the maximum 
distance between nodes in the cluster is: 



,--|>0.5 



X < 2 



1- 



1 



4i. 

X < 0.5858 



(Equation 14) 
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or d < 0.6p. 

Referring to FIG. 8C, to ensure at least 50% overlap in the data to employ the 
local data aggregation scheme discussed above, the maximum distance between any two 
5 nodes whose signals are beamformed must be approximately 0.6p. If the maximum 

distance is greater than 0.6p, the cluster-head must perform separate data aggregations to 
ensure high quality signals. 

In an alternate embodiment of the cluster formation process, when clusters 100 
10 need to be formed (e.g., at the beginning of each round 130), each node 110 sends some 
information to base station 102 and base station 102 executes an optimization algorithm 
to determine the optimal clusters 1 12 for that round 130, The clusters 112 formed by the 
base station 102 will in general be better than those formed using the distributed 
algorithm described above. However, this approach requires that each node 110 transmit 
1 5 information to the base station 1 02 at the beginning of each round 130 about its location. 
This may be achieved by using a GPS receiver (not shown) that is activated at the 
beginning of each round 130 to get the current location of each node 110. In addition, 
each node 110 must send its current energy level to the base station 102. 

20 In this base station cluster formation process, the base station 102 computes the 

average node 110 energy, and whichever nodes 110 have energy below this average 
cannot be a cluster-head for the current round 130. Using the remaining nodes 1 10 as 
possible cluster-heads 110,},,, the base station 102 performs a simulated annealing 
algorithm as is known in the art, to determine the best k nodes 1 10 to be cluster-heads 

25 1 10,5,, foi* ^^^t ^^^^^ 130 and the associated clusters 100. This approach minimizes 
the amount of energy the non-cluster-head nodes 1 lO^^^..^.^ will have to use to transmit 
their data to the cluster-head 1 10,,,, by minimizing the total sum of squared distances 
between all the non-cluster-head nodes 1 lO^^^^.,^, and the closest cluster-head 110,,,. 
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In one embodiment, the base station 102 can use the simulated annealing 
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algorithm, given the current optimum set of cluster-heads 110, C, a new set of cluster- 
heads 110' C will become optimal at iteration k with probability 

-(f(C )-f(C))/ (lOOOe-^'"^*^ 

e : f(CO>f(C) , ^.^ 

rfk ^ ( (Equation 15) 

^ ' : f(C)<f(C) 

5 where /(C) represents the cost function defined by 

f{C) - y min (/,c) (Equation 16) 

where d(i,c) is the distance between node 1 lOi and node 1 10c. In this case the nodes 110 
having above average energy are more likely to become cluster heads 1 lO^j^^ • This 
algorithm typically converges in 200-500 iterations for a network which includes one 
1 0 hundred nodes 110. Since these computations are being performed at the base station 
102, energy dissipation is not a concern. It should be appreciated that other algorithms 
which minimize the cost function/C) can be used for cluster allocation. 

Once the optimal cluster-heads 1 lO^,,^ and associated clusters 1 12 are found, the 
1 5 base station 1 02 transmits this information back to all of the nodes 1 1 0 in the network. 
This is done by broadcasting a message that contains the cluster-head ID for each node 
110. If a node's cluster-head ID matches if s own ID, that node 110 takes on the cluster- 
head 1 10,1, ^ole; otherwise, the node 110 determines its TDMA slot for data transmission 
and goes to sleep until it is time to transmit data to its cluster-head 1 lOA^j,. The steady- 
20 state phase in this approach is identical to that shown in FIG. 5. 

In another alternate embodiment, an "event-driven" protocol is implemented so 
that sensors nodes 110 only transmit data if some event occurs in the environment 
thereby further reducing the amount of data transferred to the cluster-head 1 lO,,,, and to 
25 the base station 102. 



All publications and references cited herein are expressly incorporated herein by 
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reference in their entirety. 

Having described the preferred embodiments of the invention, it will now become 
apparent to one of ordinary skill in the art that other embodiments incorporating their 
5 concepts may be used. It is felt therefore that these embodiments should not be limited to 
disclosed embodiments but rather should be limited only by the spirit and scope of the 
appended claims. 

What is claimed is: 



